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We investigate intercell interference cancellation (ICIC) with a practical downlink training and uplink 
channel state information (CSI) feedback model. The average downlink throughput for such a 2-cell network 
is derived. The user location has a strong effect on the signal-to-interference ratio (SIR) and the channel 
estimation error. This motivates adaptively switching between traditional (single-cell) beamforming and ICIC 



at low signal-to-noise ratio (SNR) where ICIC is preferred only with low SIR and accurate channel estimation, 
and the use of ICIC with optimized training and feedback at high SNR. For a given channel coherence time 
and fixed training and feedback overheads, we develop optimal data vs. pilot power allocation for CSI training 

> 

\Q ■ as well as optimal feedback resource allocation to feed back CSI of different channels. Both analog and finite- 

O 

rate digital feedback are considered. With analog feedback, the training power optimization provides a more 
significant performance gain than feedback optimization; while conversely for digital feedback, performance 

in' 

is more sensitive to the feedback bit allocation than the training power optimization. We show that even with 
low-rate feedback and standard training, ICIC can transform an interference-limited cellular network into a 
noise-limited one. 



I. INTRODUCTION 

Multicell processing, also called base station coordination or Coordinated Multi-Point (CoMP) 
transmission/reception, is an efficient technique to mitigate intercell interference in multicell networks, 
and it has been attracting significant attention not only from academia [1] but also from industry [2]. 
By coordinating the transmission and reception of multiple base stations (BSs), this technique can in 
principle eliminate intercell interference and transform cellular networks from the familiar interference- 
limited state to a noise-limited one. 

While theoretically appealing, thus far the demonstrated throughput gain of multicell processing in 
realistic simulations and field trials has been largely disappointing, with typical gains in the 5-15% 
range for an LTE-type system [2], [3]. The large losses versus theory are primarily attributed to the cost 
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of overhead in the form of channel state information (CSI) acquisition and feedback and the difficulty 
in maintaining the benefits of CoMP in the face of mobility. The need to support at least moderate 
mobility makes it impossible to amortize these overhead costs over sufficient time periods, as well 
as causing a fundamental limitation due to the delays inherent in exchanging the various overhead 
information, which can easily approach a large fraction of the channel coherence time. This motivates 
directly considering the necessary overhead sources when analyzing multicell processing, optimizing 
and designing the system for it from the beginning, and thus hopefully achieving higher real-world 
gains when those overhead losses inevitably occur. 

Specifically, we consider a particular multicell processing technique for downlink coordination we 
term intercell interference cancellation (ICIC). Opposed to full base station cooperation, ICIC is a 
type of coordinated single-cell transmission, as the signal for each user is transmitted from a single BS 
while the neighboring interfering multi-antenna BSs cancel interference for this user. This means it has 
low demands on the backhaul capacity, which significantly reduces latency as no inter-BS data sharing 
is needed and also makes the system robust to synchronization error among BSs. In our previous work 
[4], we have shown that ICIC can efficiently combat intercell interference and provide performance 
close to that of multicell processing with full inter-BS data sharing. The goal now is to investigate 
the performance of ICIC with a practical CSI model, where the transmit CSI is obtained through 
downlink training and uplink feedback. We provide a unified framework to analyze the ICIC system 
with different CSI assumptions, upon which the training and feedback phases can be optimized. 

A. Related Work 

Initial theoretical studies on multi-cell processing assumed full data sharing between BSs and global 
CSI, which enables global coordination and leads to upper bounds [5]-[7]. To reduce the requirement 
of backhaul capacity and CSI, clustered coordination schemes have been proposed, including intercell 
scheduling [8] or more sophisticated interference cancellation approaches [9]. The impact of limited 
backhaul capacity was investigated in [10], [11]. In [12], [13], distributed BS coordination strategies 
were proposed based on the virtual signal-to-interference-and-noise (SINR) framework. 

The presumed need for a large amount of CSI is a major obstacle for multicell processing. In [14], 
[15], it was shown that when the CSI overhead is actually taken into account, conventional cellular 
architectures with no BS coordination are still quite attractive relative to the supposedly superior 
multicell processing approaches just mentioned. The impact of CSI estimation error in multicell 
processing was analyzed in [16], while [17], [18] proposed limited feedback techniques for coor- 
dinating BSs. These results demonstrated the importance of considering CSI overhead/accuracy, but 
an accurate characterization of multicell processing systems with both CSI training and feedback, and 
the corresponding performance ODtimization. are not vet available. 



3 



A related body of research exists for single-cell multiuser MIMO (multiple-input multiple-output) 
systems, since multicell processing can be regarded as a distributed multiuser MIMO system. Downlink 
multiuser MIMO requires transmit CSI, and extensive studies have been done on CSI acquisition 
and optimization for multiuser MIMO. Limited feedback, which provides quantized CSI to the BS 
and has been successfully implemented in single-user MIMO, has recently received lots of attention 
in the context of multiuser MIMO [19], [20]. Limited feedback combined with opportunistic user 
scheduling was investigated in [21], [22]. In [23], [24], the multiuser MIMO system with both delayed 
and quantized CSI was studied, which shows that single-user MIMO may in fact be preferred over 
multiuser MIMO with imperfect CSI. In [25] a comprehensive study of the MIMO broadcast channel 
with zero-forcing (ZF) precoding considered downlink training and explicit channel feedback, with such 
overhead optimized in [26]. In [27], a similar optimization problem was studied for a beamforming 
system with limited feedback. Though ICIC bears similarities with multiuser MIMO, the multicell 
aspect makes CSI training and feedback more decentralized and hence challenging. 

One important property of ICIC is that the CSI possessed by the home and neighboring interfering 
BSs are related to the information signal and interfering signals, respectively. This yields a few 
important consequences and opportunities. First, the channels from the home and neighboring BSs 
are statistically independent, which makes the performance easier to analyze. Second, these channels 
have different path losses, which means the user location will strongly affect the SIR and the quality of 
channel estimation. Third, the different roles of CSI at the home (related to the information signal) and 
neighboring BSs (related to interfering signals) motivate adaptively allocating the uplink resource to 
feed back different channels. Previous works on training and feedback optimization for beamforming 
and multiuser MIMO in [26], [27] focused on the overhead optimization, which cannot be easily 
implemented in current systems without significantly altering the standards. In this paper, we consider 
pilot vs. data power allocation in the downlink training and feedback resource allocation (i.e., feedback 
symbol power allocation for analogy feedback and feedback bit allocation for digital feedback) in the 
uplink, which is more amenable to implementation. 

B. Contributions 

In this paper, we investigate the performance of ICIC in a 2-cell network, explicitly considering the 
overheads of CSI training and feedback. The main contributions are as follows. 

Throughput analysis. We provide a unified framework to evaluate the average achievable throughput 
with single-cell beamforming or ICIC at each BS. It shows that the user location has a significant effect, 
since it affects SIR and the quality of channel estimation. As canceling interference for neighboring 
users will reduce the signal power for the home user and in addition imperfect CSI degrades the 
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performance, ICIC is not always preferred especially at low SNR. Adaptive strategies are required to 
improve the system throughput: at low SNR, adaptively switching between single-cell beamforming 
and ICIC is needed, and ICIC is preferred only when the CSI estimation is accurate and SIR is low; 
at high SNR, training and feedback can be optimized based on the user locations, fading block length, 
and the average receive SNR. 

Training and feedback optimization. For training optimization, we consider pilot vs. data power 
allocation, with a tradeoff between the channel estimation accuracy and the downlink throughput. For 
analog feedback, we optimize the power allocated to the feedback for different BS channels, while for 
digital feedback, we optimize the number of feedback bits allocated to different BS channels, both of 
which achieve a balance between the CSI accuracy of the information and interference signals. The 
training optimization, since it is broadcast, is common to all users while the feedback optimization is 
performed individually by each user. 

• Training Optimization: We first provide a sufficient condition under which there is no benefit 
to optimize the training overhead and simply optimizing the pilot vs. data power tradeoff is 
adequate. The optimal training power tradeoff is then found, which shows that with block length 
T, and N B BSs each with N t antennas, the power allocated to each pilot symbol is proportional 
to \JrjN~t/ Nb- 

• Feedback Optimization: For periodic analog feedback, it is shown that downlink power tradeoff 
optimization provides a more significant performance gain than uplink feedback power optimiza- 
tion. Alternatively, for finite-rate digital feedback, the uplink feedback bit allocation is more 
important than training optimization. 

The performance gain of training and feedback optimization is demonstrated by simulation, which 
shows that ICIC with CSI training and feedback provides significant average and edge throughput gains 
over conventional single-cell beamforming. Specifically, with proportional uplink/downlink transmit 
power (for analog feedback) or with feedback bits proportional to the uplink channel capacity (for 
digital feedback), ICIC with standard training and feedback provides performance approaching that 
with perfect CSI, and successfully transforms an interference-limited cellular network into a noise- 
limited one, even accounting for the necessary overhead. 

II. System Model 

A. Signal Model 

We consider a multicell network with N B BSs each with N t antennas, and there is one active 
single-antenna user in each cell. Universal frequency reuse is assumed. The objective is to design an 
efficient multicell processing strategy to suppress intercell interference. To retain tractability, we will 
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focus on the 2-cell network as shown in Fig. [T] The BS and user in the i-th cell are indexed by i, 
while the BS and user in the other cell are indexed by i = mod(i, 2) + 1 for i = 1, 2. 

We focus on the downlink transmission. For the data symbol transmission, the discrete baseband 
signal received at the i-th user (i — 1, 2) is given as 



yi = y/PdLi^&Xi + y/ PdLijhZjfiX-i + z h (1) 

where a* is the conjugate transpose of a vector a and 

• Xi is the transmit signal from the i-th BS for the i-th user, with the power constraint E[|a;j| 2 ] = 1. 

• Zi is the complex white Gaussian noise with zero mean and unit variance, i.e., z % ~ CA^(0, 1). 

• Pa is the transmit power for data symbols and Li j is the pathloss given by Li j = r] (D /dij) a , 
where D is the reference distance, r] is a unities s constant that depends on the antenna charac- 
teristics, and dij is the distance between user i and BS j. In the following, we set D = R, so 
r]Pd is the average received SNR at the cell edge. 

• hj j is the N t x 1 channel vector from the BS j to user i, where each component is i.i.d. £A/"(0, 1). 
We consider a block fading model, where the channel is constant over each block of length T 
and is independent for different blocks. 

The vector fj is the precoding vector at BS i, i — 1,2. It is normalized and is designed based on 
the available CSI at the BS. Although the main focus is on ICIC (with the precoder denoted as fjjcX 
we also consider conventional single-cell beamforming (with the precoder denoted as f^BF): 

• Single-cell Eigen-beamforming: The precoding vector is the channel direction, i.e., for the i-th 
user f i>B F = h^j/Uhj jU. Therefore, the signal term is distributed as |f* BF ]fi i;i | 2 ~ X2N t > where 
xi denotes the chi-square random variable with n degrees of freedom. When applying eigen- 
beamforming in multicell networks, each user will suffer interference from other cells. 

• Intercell Interference Cancellation (ICIC): Taking cell 1 as an example, to cancel its interference 
for users in cell 2, 3, • ■ ■ , N B , (N B < N t ), and also to maximize the desired signal power 
|fi Ic hi ; i | 2 , the precoding vector fi ic is chosen in the direction of the projection of vector h^i 
on the nullspace of vectors H = [h 2j i, h 3 i, • • • , hjv s ,i] [28], i.e., the precoding vector is the 
normalized version of the vector wf = ^1 — H ^H*H^ H*^ hi i. From [28], we have the 
distribution of the signal power as |fj IC ]fii 5 i | 2 ~ x\iN t -(K-i)y N° te that the precoder design only 
requires local CSI, i.e., BS k only needs hj j with j = k, and i — 1, 2. 

Assuming user j selects the transmission strategy Sj G {IC,BF}, j = 1,2, the receive signal-to- 
interference-plus-noise ratio (SINR) for user i is 



sinr,( Si , s 2 ) = - l0 . (2) 



Pd,Li : i 


h* f 


2 


1 + PdLij\ 


h*.f- s _|2 

1,1 1 
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The SINR not only depends on the transmission strategy pair, but also depends on the available CSI, 
which affects the precoder design. 

B. The CSI Model 

To enable transmit precoding, downlink CSI is required at the BS. In TDD (Time Division Duplex) 
systems with channel reciprocity, transmit CSI can be obtained through uplink training. For systems 
without channel reciprocity, such as FDD (Frequency Division Duplex) systems, downlink training 
and uplink feedback are normally applied to provide transmit CSI, which is our focus in the paper. 
Each fading block of length T is divided into three phases: a downlink training phase of T t channel 
uses, an uplink feedback phase of Tjb channel uses, and the data transmission phase of Td channel 
uses. CSI training is discussed in this subsection, while feedback will be treated in Section HVl 

The CSI at each user is obtained through downlink training with dedicated pilot symbols. We 
consider orthogonal training, where the training phase spans T t (T t > NsNt) channel uses, using 
orthogonal training sequences {0 O , 4>N B N t -i}, with fa G C TtXl . The set of training sequences 
is partitioned into N B disjoint groups each with N t sequences, denoted as $j for the ith BS, i = 
1, 2, ... , N B . The power scaling factor is yj jfcPt so the transmit power for the pilot symbols from 
each BS is T t P t , which sets the power constraint for each pilot symbol to be P t . For simplicity, we 
normalize T and T t as T = and T t = . 

Different from conventional single-cell processing systems, we assume that each user estimates CSI 
from both its home BS and the neighboring BS. The user i estimates the channel from BS j based 
on the observation 

Sjj = yj T t P t L id hij + z i5 i, j = 1, 2, (3) 

corresponding to the common training channel output, where Zj ~ CAf(0, IjvJ- If i = j, corresponds 
to the pilot from the home BS; otherwise, it corresponds to the pilot from the neighboring BS. While 
the channel estimation quality of the home BS determines the information signal power, the channel 
estimation of neighboring cells determines the residual intercell interference. Therefore, the training 
design for multicell processing is quite different from conventional single-cell processing systems. 
The minimum mean square error (MMSE) estimate of hjj given the observation Sjj is [29] 

jTtPtLu 

The channel h^ j can be written in terms of the estimate hij and the estimation noise ny as 

hij = \j + n id , i,j = 1, 2. (5) 
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With the MMSE estimator [29], n^- is independent of the estimate and is zero-mean Gaussian with 
covariance afjl Nt with afj = 1+TtPtL . . , while hy is with covariance «fjljv f , = 1 - ofj. 

Considering the two downlink transmit phases, i.e., the training phase and the data transmission 
phase, we have the following constraints: 

T t + T d = T — T fb , P t T t + P d T d = P d \T - T fb ), (6) 

where T d = j^, Tfb = and P dl is the average power constraint in the downlink. The training 
optimization with these constraints will be investigated in Section |nij 

With the receive SINR given in ©, treating intercell interference as additive white Gaussian noise, 
we are interested in the following average achievable throughput 

Riis!, s 2 ) = E [log 2 (1 + SINR,)] , % = 1, 2. (7) 

Remark 1: To achieve this throughput, an additional dedicated training round is needed after the 
uplink feedback phase for each user to get its received SINR value [25]. For convenience, we ignore 
this dedicated training round and assume there is a genie who provides each user the actual SINR, 
and this genie-aided throughput will be used as the performance metric throughout the paper. As the 
capacity of this kind of interference channel is unknown even with perfect CSI, our focus is on the 
achievable throughput with specific training and feedback methods. 

Considering the overhead due to training and feedback, the effective achievable throughput is 

Ri( Sl , s 2 ) = (l - Tt t- T/6 j IU{8i, s 2 ), i = l,2. (8) 

We will use Ri, -R^Tj -R«,aFB> an d -R^dFB to denote the throughput of user i with perfect CSI, downlink 
training, training and analog feedback, and training and digital feedback, respectively. 

C. Auxiliary Results 

This subsection provides useful results that are used for throughput analysis throughout the paper. 
The average achievable throughput in © can be in general expressed as E[log 2 (l+X)] with X = jry, 
where Z and Y are the signal power and the interference power, respectively. With independent 
Rayleigh fading channels, the following results from [4] can be used for throughput analysis for 
different systems considered in the paper. 

Lemma 1: Consider the random variable X = f^y, where a > 0, and Z and Y are independent. 

1) If Z ~ x\m and Y = 0, then 

M—l 

KU(a, M) = E x [log 2 (1 + X)} = log^e 1 /" £ r( ~^ 1/a) . (9) 

fc=0 
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2) If Z ~ x 2 ^, y ~ /? ■ Xi then 

AT-1 i 



m—x t 1+1 — i / -I \ 

fc« (a, /3, M) ^ E x [log 2 (l + X)] = log 2 (e) £ £ - - ■ h |, i, J + lj , (10) 



i=0 1=0 

"GO —m^ — ax 



where Ii(a,b,m,n) = / °° faf^iT^ij dz, with a closed-form expression given in [4]. 
3) If Z ~ ^ = /^i^i + ^2^2 with Yi ~ xl' ^2 ~ xl» an ^ tne y are mutually independent, 



K®(a,0i,0 2 ,M)±E x [log 2 (l+X)] 



a 



, (1 , (A-A)(<-0l 



7i ( - ,— ,i,Z + l - Ji - ,— ,i,Z + l 



a fix J \a j3- 



>2 



(ID 



III. ICIC with CSI Training (T t > 0,T fb = 0) 

In this section, we consider the 2-cell network with downlink CSI training and assume that the BS 
has direct access to the CSI estimated at the user. The results developed following this assumption 
can be applied to the TDD system where the transmit CSI can be obtained through uplink training, 
and they will also be applied to the system with both training and feedback in Section HVl 

A. Throughput Analysis and Adaptive Transmission 

One unique property of wireless networks is the spatial distribution of different nodes, which causes 
multiple order of magnitude fluctuation in the information signal power and the interference level. 
Three typical scenarios of user locations in the 2-cell network are shown in Fig. Q] As shown in [4], 
concerning the sum throughput, it is not always optimal to apply ICIC, especially for low to medium 
receive SNRs (e.g., for both users in scenario (b), or user 2 in scenario (c) in Fig. [D, as cancelling 
interference for the neighboring user will reduce the signal power for the home user. Therefore, it 
is necessary to switch between single-cell beamforming and ICIC based on user locations. When 
considering CSI training, as the pilot symbols received from the home BS and the neighboring BS 
come from different propagation paths, the user location now will also affect the accuracy of the 
estimation for different channels. 

For adaptive transmission strategy selection, we first derive the average achievable throughput, given 
in Proposition [7J 

Proposition 1: The average achievable throughput of user % (i = 1,2) with estimated CSI is 
approximated by 

' KW^PiL^, P d L Ql N t ) {s h si) = (BF, BF) 

K®(KliPdLi,i, ^-P d La, N t ) { 8i , = (BF, IC) 



Rlt{sx, s 2 ) 



(12) 

rc< 2 >(«? f P dJ L M , a^P d L^ N t - 1) (s h sj) = (IC, IC) 
TZ^UlPdLu, P d L tJ , N t - 1) ( Si , Si) = (IC, BF) 
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where TZ^ is given in (flOl) . 

Proof: See Appendix [A] ■ 
To maximize the sum throughput, the preferred transmission strategy pair is then determined as 

0*, s 2 ) = arg max Ri,t(si, s 2 ) + -R2,t(si, s 2 ). (13) 

si,s 2 e{BF,IC} 

Fig. [2] shows the simulation and calculation results with perfect CSI and CSI training, where user 
1 is at the location (-0.1R, 0) in Fig. [TJ The results for perfect CSI come from [4]. It shows that the 
approximation (fT2l) is accurate and the estimated CSI degrades the performance. Specifically, with CSI 
training, when user 2 is in the interior (near (R, 0)), there is little or no performance gain for user 1 
to perform ICIC for user 2, which is quite different from the perfect CSI case. This is because when 
user 2 is close to BS 2, the received pilot signal power from BS 1 becomes weak, which reduces the 
estimation accuracy and results a high level of residual interference from BS 1. 

In Fig. |3l we plot the preferred transmission strategy pairs for different user locations, with edge 
SNR 4 dB. The preferred strategy pair is determined by (fl3V Comparing the system with perfect 
CSI and with CSI training, the operating region of each strategy pair changes significantly. With CSI 
training, the strategy IC is picked at a given BS only when the neighboring user is very close to the 
cell edge, where the CSI estimation is accurate and the performance gain from ICIC is high. 

B. High SNR Performance and Training Optimization 

In this subsection, we investigate the high-SNR performance of ICIC. While adaptive transmission is 
required at low SNR, ICIC is always preferred at high SNRs due to its capability to suppress intercell 
interference, as the multicell network is interference-limited at high SNR. 

With perfect CSI, intercell interference is completely cancelled, and the average achievable through- 
put for the 2-th user (i = 1, 2) at high SNR can be approximated as R4 w E log 2 f^iji^dX^jvi -1)) 
where xln denotes a chi-square random variable with 2n degrees of freedom. As E [logx 2n ] = x P( n 



ni 



where ip(-) is Euler's digamma function that satisfies ijj(m) = ^(1) + YmLi 7 f° r positive integers 
and ^(1) « -0.577215, we have 

Ri « log 2 P d L hi + log 2 e ■ V(iV t - 1) = log 2 (P d L iti e^ N ^) , (14) 

which depends on the user location and P d . 

When CSI is obtained through downlink training, the channel estimation error will degrade the 
system performance. The following lemma shows the impact of CSI training at high SNR. 

Lemma 2 (Throughput loss due to training): At high SNR (assuming P t ,Pd — > 00 with ^ = u), 
the throughput loss of ICIC due to CSI training for user i (i = 1, 2) is 



AR hT = R i - R i<T w E 



log(l + ^)l=^(f l), (15) 
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where is given in ©. 

Proof: See Appendix [Bj ■ 

The throughput loss due to training is a constant at high SNR and depends on the ratio of v and T t . 
It decreases as v decreases (more power is allocated to training symbols) and/or the training period 
T t increases. For example, if P d = P t and T t = N B with N B = 2, then AR iyT ns R^(l/2, 1) « 
0.52 bps/Hz. This rate loss is negligible at high SNR. 

Based on the throughput loss and applying Jensen's inequality (=-, lj < log 2 ^1 + =M, we 
can obtain a high-SNR approximation for the achievable throughput with CSI training for user i 

R i>T « Rt - log 2 1 + -± =-)= log 2 l f (16) 
V PtTj P d l + T t Pi 1 

From this expression, the achievable throughput with CSI training depends on the power allocation 

between P t and P d , and the training overhead T t , which motivates to optimize downlink training to 

improve the performance. 

In general, the training optimization involves optimizing both P t and T t . We first consider optimizing 

P t . We assume different BSs will have the same P d and P t , which is desirable for practical system 

implementation. Considering user i (i = 1,2), the training power allocation problem is (P*,P^) = 

arg m&X7p t p t+ 7p dPd= 7r pd i R^t. With the approximation (fTol . the problem is reformulated as 

(Pt,PZ) = arg_ min _ ±- + -L. (17) 

T t Pt+T d P d =TP P d T t P t 

From this formulation, the training power allocation is the same for different BSs and is independent 
of user locations. This fits the practical requirement, as each BS transmits the common pilot symbols 
for all users in the cell. 

The minimization problem (fTTT) is a convex optimization problem, and following the KKT (Karush- 
Kuhn-Tucker) condition we can get the solution as 

pdl rppdl 

Pd= ==■ == — -, Pi = =r-, = — :• (18) 

Next, we consider optimizing both P t and T t (Nb < T t < T). For a given T t , P* is obtained as 
in ([TBI . With the high-SNR approximation (fT6l) . and substituting P* and P%, we have the following 
approximation for the equivalent throughput considering training overhead 

which is concave in T t , so the optimal length T* (Nb <T t <T) can be found by a line search, e.g., 
using the bisection method. However, the solution will depend on the location of user i through L^. 
This means that we cannot simultaneously optimize T t for users at different locations, which conflicts 
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the common pilot design rule. Fortunately, this will not cause an issue, as the optimal T t is obtained 
at its minimum possible value Nb under a mild condition, stated in the following lemma. 
Lemma 3: A sufficient condition for T t = Nb is 



P dl L u > exp 

'T^Nb + I 
Proof: See Appendix 




Xu +2io g Ji-^ + l \-w-v 



1,2. (20) 



Remark 2: This result shows that pilot/data power allocation is sufficient for training optimization 
under condition (|20l) . and there is no need to optimize T t . 

1) This is a condition on the average receive SNR at each user. If the average edge SNR is greater 
than the threshold, then all the users in the cell satisfy this condition. 

2) The threshold at the right-hand side of (l20l depends on Nb, N t , and T, and it is a decreasing 
function of Nb and N t . This condition is easy to satisfy. For example, if T = 100, Nb = 2, we 
need P dl L tl > 0.87 dB for N t = 4, and P dl Lu > -3.24 dB for N t = 8. 

3) A similar result is also shown in [30] for single-cell multiuser MIMO systems. In fact, for a 
single-user MIMO system, there is never a need to optimize T t once P t is optimized [31]. 

As condition (1201) is easy to satisfy, we will only consider pilot power optimization in the following, 
and assume T t = Nb. When T is very large, for fixed T t , we have Td — > T, and from (fl"8l we have 



p* ^ pai^ p* yLLp dl _ So the power allocated to training symbols is proportional to i.e., for 
long blocks, more power will be allocated to each training symbol. 

IV. ICIC with CSI Training and Feedback (T t > 0,T /6 > 0) 

In this section, we consider the CSI model with both training and feedback. Two types of feedback 
are discussed: analog feedback and digital feedback. For each feedback type, we first derive the average 
achievable throughput, and then optimize the training and feedback phases. 

A. Training with Analog Feedback 

We first consider analog feedback where the estimated CSI at each user is fed back to the BS using 
unquantized and uncoded QAM [32], [33]. The uplink feedback channel is assumed to be an unfaded 
AWGN channel as in [26], [30]. As each user needs to feed back CSI for Nb BSs, we divide the 
feedback block Tf b into N B equal-length sub-blocks. During the jth sub-block, at user i, i — 1, 2, the 
estimated CSI, or equivalently a scaled version of the received training vector Sjj (j = 1,2), given 
in ©, is modulated by a jj^ x Nt unitary spreading matrix with the power constraint j^Pfb,ij [32]. 
We assume orthogonal feedback, so ^ > NsN t , i.e., Tfb > N B N t . Although the feedback can be 
received by both BSs, the home BS i is responsible for the final channel estimation as it is closer 
to user i. i.e.. BS i will estimate both h, ^ and h, o and will Dass the estimation to the neighboring 
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BS over the backhaul link. In the following discussion, we fix Tj b and focus on the power allocation 
{Pfb,n,Pfb,i2), with the constraint 
T TV 

-Tf-Pfb,il + -T-j—Pfb,i2 = TfbP U Or Pfb,il + Pfb,i2 = P Nb, (21) 

where P ul is the uplink transmit power constraint. 

As the uplink channel is modeled as an unfaded AWGN channel with pathloss, the received feedback 
vector at the zth BS after de-spreading is 



T fb p J Tfb p T 

\ r.' l' h 'J 1 '- . /Mj M i >-rnr i 1 

- , _ Sij + w itj = _ ( V TtPtLijhij + Zij ) + w ii3 

'l + TtPtLij Jl + TtPLij 



j^Pfb,ijLi : iT t P t Lij 



1 + T t P t L M 



hij + Wij, j = l,2, (22) 



where w$j is the equivalent noise, and 3 ~ CA/YO, of -) with a? = rrrprh ~ + 1. If i = j, the 
feedback is for the home BS channel, which determines the signal power; if % ^ j, the feedback is 
for the neighboring BS channel, which is related to the interference level. This motivates the feedback 
power allocation with the constraint (|2TT) . 
The MMSE estimate of the channel vector is 



±fb p T rp pj 

N R fb,ij- L 'i,i J - t* 



= j _ 7- (23) 

'l + TtPtLtj {^Pfbhi + \ j 



Then the actual channel vector hjj can be written as h^- = hjj + e$j, where hjj and e^- are 

rp p p f b p ■ ■ \j ■ 

independent with variances kf ■ = '' J , Nb Ttu '' J — r and of,- = 1 — k 2 - ■„ respectively. 

The precoding vectors are designed assuming that hjj = 1,2) are the actual CSI. As the 
distribution of is similar to hjj with different variances, following the same derivation for the 
case with training only, we can get the following proposition. 

Proposition 2: The average achievable throughput of user i with training and analog feedback is 
approximated as 

' n^(klP d L hl , P d L ih N t ) {s i} St) = (BF, BF) 

n^(kl t P d L ht , cr^PaL^, N t ) ( Si , in) = (BF, IC) 



Ph,hFb(si, s 2 ) 



(24) 

n^(klP d L hlJ a^P d L i7h N t - 1) (s u si) = (IC, IC) 



n^{k%P d L^, P d L ith N t - 1) (s h si) = (IC, BF) 
where TZ^ is given in (ITOb . 

This result can be used for adaptive transmission strategy selection. 
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In the following, we will optimize both training (optimizing (P d , P t )) and feedback (optimizing 
(Pfb,n, Pfb,i2) for i = 1,2). We assume that Tfb is fixed, as the modification of Tfb will affect the 
uplink traffic channel while our discussion focuses on the downlink transmission. A systematic design 
of both downlink and uplink transmissions is beyond the scope of this paper. 

1 ) Training Optimization: We first consider training optimization. At high SNR, similar to Lemma 
El the rate loss due to training and analog feedback can be approximated as 

Ri - Rt^B « log 2 f 1 + L i7i P d I 1 + =-L- - J J , (25) 

which is a constant rate loss if § and are constants. 

Substituting (fl4l) . we get the following high-SNR approximation for -Rj^fb 

#i,aFB ~ log 2 = — T . (26) 

For given Tfb and Pfb,u, the throughput maximization problem is equivalent to 

1 1 

_ _ min_ _ h = — , (27) 

TtPt+T d P d ={T-T fb )P dl Pd T t P t 



which is independent of the feedback allocation. This problem is the same as (1181) . and we get the 
following results 

PS= %T T ^ pdl , jy-gzjgg. (28) 

vXi(vXi + i) ' T t (VTd + l) 
The solution depends only on the intervals of different transmission phases, i.e., T t , T d , and Tfb. 

2) Feedback Optimization: Next, we consider feedback optimization, i.e., optimizing (Pfb,a, Pfb,i2) 

for i = 1,2. Note that the uplink feedback optimization is done individually for each user, while 

the downlink training optimization is the same for all users. The feedback optimization is over the 

following approximation for the average SINR 

P d L hl kl(N t -l) 



SINR^ ' ^ , 2 '- , i = l,2. (29) 
1 + PdLaafj 



This is reasonable as log 2 (l + SINRj) gives an upper bound on the average achievable rate for user i. 
From (l29l) . the feedback power allocation problem at user % can be stated as 



Increasing Pfb,u will increase in the signal term and also increase afj in the interference term, 
and feedback power allocation will balance these different effects. 

Denote x = j^Pfb,aLij, a = T t P t L iy i, b = PdL^, and p = Tf b P ul L i:i , then problem (|30T) is 
equivalent to 

X 
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Denote Ai = 1 + p, A 2 — 1+ Q a b +fe , the objective function can be rewritten as 

1 + a f 1 
l+a+&\ 1 + Ai + A 2 

So the maximization problem is equivalent to max < x < p f{x) with f(x) = ^ztxT+^ ~ ^+1^ • > ^' 

A 2 > 0, and Ai = p + 1 > x, the first and second terms are both concave, so the objective function is 
concave. Setting = 0, we have 

[(1 + Ai) - (Ax + A 2 )A 2 ] x 2 - 2(Ai + A 2 )(l + Ai + A 2 )x + Ai(Ai + A 2 )(l + Ai + A 2 ) = 0. (33) 

Denote ( the solution pair of (l33l . if x\ G [0,p], i — 1,2, then it is the solution for the 

original problem; otherwise, the maximal value is obtained at the edge and x* = p is the solution, as 
x = makes the objective function to be which is obviously not the maximum. 



(Ai + A 2 )A 2 1 + Ai 



x 



(Ai + A 2 



x + 1 



(32) 



B. Training with Digital Feedback 

In this part, we consider digital feedback, also called finite-rate or limited feedback [19], [20], which 
feeds back quantized CSI. We assume user i (i — 1, 2) feeds back a total of £>,; bits, among which Bn 
bits is for the channel estimate h^i of BS 1 and B i2 bits for the channel estimate h i 2 of BS 2. The 
feedback channel is assumed to be error-free and without delay. The feedback interval is Tf b = piBi, 
where pt, is a conversion factor that relates bits to symbols, e.g., p, = 1 for BPSK feedback. 

The channel estimate hjj, i,j = 1, 2, is fed back using a quantization codebook known at both the 
transmitter and receiver, which consists of unit norm vectors of size 2 Bij . We assume each user has 
multiple quantization codebooks, with the codebook of size 2 Si ^ denoted as Cij = {c 1; c 2 , ■ ■ • , c 2 s^ }. 
Note that in practice the number of codebooks at each user may be limited to a small number, which 



is not considered in the paper. The quantized channel vector is hjj = argmax C( , gC - 



h »,j r 



. The 



random vector quantization (RVQ) codebook [20], [34] is used to facilitate the analysis, where each 
quantization vector is independently chosen from the isotropic distribution on the A^-dimensional unit 

h*„ , 



sphere. Denote cos 9. 



1,3 



'■J 



, and then with RVQ we have [34] 



(AT \ Bj j 

2Bl,J '^ri) > 1-2"^, (34) 

where (3(x,y) is the Beta function. 

We first derive the average achievable throughput for given B, t j, i,j = 1, 2. 

Proposition 3: The average achievable throughput for user i with training and digital feedback is 
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RiAFBiSl, S2) 



approximated as 

' (><U,,r,,U,. P d L Q , N t ) ( Si , si) = (BF, BF) 

^(k^^I^ ( Si ,si) = (BF,IC) 

7^ 3 )(/^P d L M ,^ - 1) (s i>sj ) = (IC,IC) 

ft«(«? i &, f P d L M , F*^, JV t - 1) ( Sf , sj) = (IC, BF) 

(35) 

where 7£.( 2 ) and 7£.( 3 ) are given by (TTOl ) and (TTTT) . respectively. 

Proof: See Appendix [Dj ■ 

This result can be used for adaptive transmission strategy selection at low SNR. In the following, 
we optimize training and digital feedback for ICIC. 

1) Training Optimization: Similar to (|25l >. we first get the following approximation for the rate loss 
due to training and digital feedback 

P. - Ri,dFB « log 2 ( 1 + L i;i P d ( - ^ + 2~^ ) ) , % = 1, 2. (36) 



With P d , P t — )■ 00 and = v, the rate loss is approximately log 2 ^1 + =- + L i - i Pd'2~ w t =x j , which 
grows with P rf for fixed Pjj. This shows that the system throughput is limited by the residual 
interference due to the quantization error. This effect can be eliminated if Bq is allowed to increase 
with the uplink transmission power. 

Then we can get the following approximation for the average achievable rate for user i (i — 1, 2) 

Pi,dFB « log 2 — . (37) 

P, 1 + (f t P t ) 1 + L hl 2~^ 

This is similar to (|26l) for analog feedback. Therefore, the optimal (P^, P f *) are also given in (|28l) . 

and they are independent of the feedback bit allocation in the uplink. 

2) Feedback Optimization: We assume each user can adaptively select the number of feedback 

bits and apply the corresponding quantization codebook for channel feedback for different BSs. The 

feedback optimization is based on the following approximation for the average SINR 

PdLii£n(Nt-i) 

SINR, « d * '- —, i = 1, 2, (38) 

1 + P d L a af-. + P d L a K a 2~^ 



which follows the distributions of signal and interference terms. Applying the bound in (|34[ >. the 
feedback bit allocation problem is formulated as 



1 _ 2 

(P*, P* ) = arg max — , i = 1, 2, (39) 

which is done individually at each user. This is an integer programming problem. To get an analytical 
solution, we will first relax the constraint that P,i and B» have to be integers. 
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Denote a = + p d T ''V'' , X = 2 nt- 1 , and x = 2 ^t- 1 . Then problem (|39T ) is reformulated as 

1 — X 

X* = arg max . (40) 

& x <x<i ^ + a 

The objective function f(x) = is concave, and set *^r- = 0, we have the solution x* = 



^ + ^ — ^ SL . Then the following solution will be used for feedback bits allocation 

ao ag ao ° 

^ = L-W - 1) log 2 x*\ , B*j = B t - l-(N t - 1) log 2 x*\ . (41) 

V. Numerical Results 

In this section, we present numerical results to verify our analysis and demonstrate the performance 
gain with training and feedback optimization. We focus on the 2-cell model in Fig. [H where the users 
are located on the line connecting the two BSs. We assume that the downlink and uplink transmissions 
have the same power constraint, i.e., P dl = P ul . The cell radius is R = 1 km, the pathloss exponent is 
a — 3, and N t = 4. As we do not consider training and feedback overhead optimization, the training 
and feedback intervals are fixed to be T t = N B N t and Tf b = N^N t , for N B = 2, respectively. 

A. Simulation vs. Approximation 

In Fig. HI we compare the derived approximations and simulations for the achievable throughput 
with training and digital feedback. Both approximation (|35l) and high-SNR approximation (|37|) are 
shown. For feedback bits B, we consider two scenarios: Fixed feedback bits: fiB = Tf b , in the 
simulation, we use p, = 1, which represents BPSK feedback, as in [27]; Varying feedback bits: the 
total number of feedback bits for user i is £>,; = L7/felog 2 (l + P ul L iti )\, i = 1,2, i.e., we assume 
perfect feedback at the rate of the uplink capacity, as in [26]. 

The accuracy of (|35l ) is shown in the fixed feedback case, where the high-SNR approximation (1371) 
appears to be a lower bound. With varying feedback bits, as simulation is of prohibitive complexity 
with large B, we only compare two approximations, and it shows the high-SNR approximation (1371) is 
accurate for this case. With varying feedback bits, the performance of ICIC with training and feedback 
approaches the perfect CSI case. However, in practice the number of feedback bits is normally fixed, 
and this will be the assumption in the following discussion. 

B. Training and Feedback Optimization 

In Fig. [51 we compare the following different systems: 

• Training + aFB I: analog feedback without optimization; 

• Training + aFB II: analog feedback with training optimization (1281) ; 

• Training + aFB III: analos feedback with training and feedback optimization (l28l). (l30l): 
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• Training + dFB I: digital feedback without optimization; 

• Training + dFB II: digital feedback with training optimization (|28l) ; 

• Training + dFB III: digital feedback with training and feedback optimization (l28l) . (l4Tj) . 
We see that training and feedback optimization provides significant performance gains. 

• With analog feedback, training optimization is more important, and additional uplink feedback 
power allocation provides limited performance gain. This is because the training power allocation 
is performed within T — T/& symbols, and P t ~ \/TP dl provides significant performance gain 
over equal power allocation; while the uplink feedback power allocation is within T/& symbols, 
and normally T > T fb . 

• With digital feedback, training optimization alone provides little performance gain, and the feed- 
back bit allocation is more important. This is because we assume a fixed number of feedback 
bits and the uplink is the limiting factor for the channel estimation accuracy, so the feedback 
optimization is more important. 

These observations indicate that we can focus on the downlink training optimization for analog 
feedback while the uplink feedback should be carefully designed when employing digital feedback. 

C. Average and Edge Throughput 

In Fig.[6l the average sum throughput and edge throughput, which is represented by the 5th percentile 
throughput, are compared for different systems with 2 users randomly located on the line connecting 2 
BSs. The adaptive ICIC system and the conventional single-cell beamforming system with perfect CSI 
are also shown for comparison. The system with and without training and feedback optimization are 
shown for both analog and digital feedbacks. Although these optimization procedures are developed for 
the high SNR regime, they will be used for all SNR values, as we have already observed the accuracy 
of the high-SNR approximation in Fig. |4] and we are more interested in the high-SNR regime where 
multicell processing provides higher performance gain. Adaptive transmission strategy selection is 
applied once the training and feedback are optimized. We can see that the analog feedback system 
provides performance close to the perfect CSI case, and the digital feedback system with B = Tfb 
is not as good but still provides significant gain over single-cell beamforming with perfect CSI. In 
addition, training and feedback optimization improves the performance of analog and digital feedback. 

VI. Conclusions 

This paper investigated an intercell interference cancellation (ICIC) system while specifically ac- 
counting for and optimizing the necessary channel training and feedback. A main conclusion is that 
despite recent pessimism, ICIC with practical CSI assumptions can efficiently combat intercell interfer- 
ence with adaptive strategies: at low SNR adaptive transmission stratesv switching is reauired. while 
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at high SNR training and feedback optimization can greatly improve the performance. Considering 
that ICIC poses light requirements on backhaul capacity, it appears to be a practical solution for 
interference mitigation in multicell networks. Although our analysis was limited to two cells to retain 
tractability, the results in the paper provide guidelines and a starting point for designing multicell 
processing systems, with many users per cell, including possibly cells that are highly heterogeneous 
in terms of coverage areas and layout [35], [36]. 

There are further key issues that require more investigation. For example, the uplink feedback channel 
is assumed to be an unfaded AWGN channel for analog feedback and as an error-free channel without 
delay for digital feedback. More realistic feedback channels should be considered in the future, such as 
the effect of fading, feedback error and delay. In some cases, where the delay is a significant fraction 
of (or even exceeds) the channel coherence time, additional feedback rate or training optimization 
may prove irrelevant. An alternative approach in that case might be to consider the recently proposed 
"retrospective" approaches to exploiting delayed CSI feedback [37], [38], which are robust to feedback 
delay and have a possibly large regime of utility [39]. 



Appendix 

A. Proof of Proposition [7] 

With CSI training, the signal power of user i (i = 1, 2) can be approximated as 



(a) 



PdLi 



(6) 



PdLi 



h* f 



PdLi^i^Nt 



BF 



PdLi^iKf i 'X^^ Nt _ l ^ Si — IC 

where step (a) follows ©, in (b) we ignore the estimation error term which is normally very small 
as its variance is inverse to the pilot power, and step (c) follows the results for the perfect CSI case 
and hi i ~ CJ\f(0, /^jIatJ. The interference term for S| = IC is 



PrlLi i h*7f; 



PdLi 



%.%} ^l.Sf 



(d) 



PdLiMljfi \ 2 ~ PaLaaltxl, 



where step (d) follows the design rule of the precoder. If s~i = BF, then PdL^ h.*^ Sj ~ P d L ij ix|. 
Therefore, both the signal power and interference power are chi-square random variables. Applying 
LemmaUl the average achievable throughput of user i with estimated CSI is approximated as in (fPZl) . 



B. Proof of Lemma \2\ 

The average throughput of user i is approximated by i?j t ~ E 

, so 



log 2 1 + 



P d , P t -> oo and g 



v, we have k? . -»• 1 and P d a 2 a -> = , 



1 + 



PdL 



d Lf i,iX2(N t -l) 

which sives the desired result. 



E 



log 2 



PdLi : iXl(N t -l) 



IL — E 



log 2 ( 1 + =-x 2 



. With 



, (42) 
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C. Proof of Lemma \3\ 



Denote :r = \/ 1 — T t /T, a = \og(LaP dl e^^ Nt ^), the optimization problem can be rewritten as 

_ f( x ) = ax 2 - 2x 2 log (x + l/Vf) . (43) 



max 



0<x<y/l-N B /T 

To obtain the maximum at T t = N B , or x = \Jl — N B /T, a sufficient condition is |£ > for 

< x < \Jl - N B /T. First, we have = 2ax ^2d_ - Ax log (x + . Set it to be greater 

than 0, we get 



a > 



x + 1/ 



^-^ + 2 log (x + l/Vr) ±g(x). 
INT v > 



(44) 



As the right-hand side is an increasing function of x, and < x < — a sufficient condition 
for (HU) to hold is a > g \ Jl - N B /Tj , which gives the result in (l20t . 

D. Proof of Proposition \3\ 

With training and digital feedback, the precoders are designed based on the quantized CSI hjj, 
j = 1, 2. The signal power for user i is 



PriLi i I h„- .•£... I 



PdLi ACh-i i ~\~ 6j j) fi.Si I 



(a) 



P,iL, , h, , • 



(6) 



-Pd-^i,i||hj 



^cos^jh^ + sin^g^)* ■ f is 



BF 



where step (a) ignores the estimation error vector which is small for large P t , in step (b) the estimated 
channel direction is decomposed into the quantized direction and its orthogonal direction g^, step 
(c) ignores the quantization error term, as sin 6^ = 0(9 iti ) <C 1 for reasonable B iti , and approximates 
cos Q\ j by its expectation, and step (d) follows the results for the perfect CSI case and the fact 
h M ~ CJ\f(0, KijI Nt ). 



If = BF, the interference power is Pd^i 



PrfLjjX2' otherwise, for = IC, 



PdLi : i\h*jii^ \ 2 — PdLii 



' 1,1/ 1,S-; 



(e) 



+ P d L Q |e*^ jS ,| £ P d L if5 Kj 5 • 2"*^ + PdL ., a 1^ 



(45) 



where step (e) ignores the term with both hjj and e { j, and step (f) follows the result in [23] that 



the residual interference term due to quantization error, 



i,i l ' s i 



is an exponential random variable. 
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The two terms in (1451) correspond to the residual interference due to the quantization error and the 
estimation error, respectively. 

Based on the above results and applying Lemma we get (1351) . 
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Fig. 1. A two-cell network. Each BS is serving a home user, which is suffering interference from the neighboring BS. We mainly 
consider the scenario where both users are on the line connecting BS 1 and BS 2, with three typical cases shown in the right figure. 
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Fig. 2. Simulation and calculation results for the 2-cell network with perfect CSI and CSI training, a = 3, Nt = 4. User 1 is at 
(-.LR,0), edge SNR 10 dB, a = 3, N t = 4. 
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Fig. 3. Operating regions for different transmission strategy pairs, a — 3, Nt = 4, edge SNR 4 dB, user 1 and user 2 are on the line 
connecting BS 1 and BS 2, 'x': (si,s 2 ) = (BF,IC); 'o': (fli,s 2 ) = (IC,BF); '+': (si,s 2 ) = (BF, BF); (si,s 2 ) = (IC,IC). 
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Fig. 4. Simulation and approximation for ICIC with training and digital feedback, T — 500. User 1 is at (— 0.1R, 0), and user 2 is 
at (0.LR,0). 
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Fig. 5. Achievable sum rates for different systems, with edge SNR 15 dB, and B — Tft, for digital feedback. User 1 is at (— 0.17?, 0) 
and user 2 is at (O.li?, 0). 
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Fig. 6. Performance comparison of different systems, with T = 500, and B = Tfb for digital feedback. "aFB" denotes the system 
with training and analog feedback, while "dFB" denotes the system with training and digital feedback. Training and feedback overheads 
are considered. 



